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ABSTRACT 

We show how redshift-space distortions of the galaxy correlation function or power 
spectrum can constrain the matter density parameter Qm and the linear matter fluctu- 
ation amplitude ■ We improve on previous treatments by adopting a fully non- linear 
description of galaxy clustering and bias, which allows us to achieve the accuracy de- 
manded by larger galaxy redshift surveys and to break parameter degeneracies by 
combining large-scale and small-scale distortions. Given an observationally motivated 
choice of the initial power spectrum shape, we consider different combinations of flm 
and (Tg and find paramters of the galaxy halo occupation distribution (HOD) that 
yield nearly identical galaxy correlation functions in real space. We use these HOD 
parameters to populate the dark matter halos of large N-body simulations, from which 
we measure redshift-space distortions on small and large scales. We include a velocity 
bias parameter that allows the velocity dispersions of satellite galaxies in halos to 
be systematically higher or lower than those of dark matter. Large-scale distortions 
are determined by the parameter combination P = /bg, where bg is the bias factor 
defined by the ratio of galaxy and matter correlation functions, in agreement with 
the linear theory prediction of parameter degeneracy. However, linear theory does not 
accurately describe the distortions themselves on scales accessible to our simulations. 
We provide fitting formulas to estimate (3 from measurements of the redshift-space 
correlation function or power spectrum, and we show that these formulas are sig- 
nificantly more accurate than those in the existing literature. On small scales, the 
"finger-of-god" distortions at projected separations ~ 0.1 ft,~^Mpc depend on ^mol 
but are independent of crgj while at intermediate separations they depend on erg as 
well. One can thus use measurements of redshift-space distortions over a wide range 
of scales to separately determine fim, fg, and a„. 

Key words: cosmology: theory — galaxies: clustering — large-scale structure of 
universe 



1 INTRODUCTION 

In a universe that obeys the cosmological principle, the clus- 
tering of galaxies is statistically isotropic. But in galaxy red- 
shift surveys the distances to galaxies are inferred from red- 
shifts, making the line of sight a preferred direction. Peculiar 
velocities produce anisotropy in redshift-space clustering on 
all scales. On small scales, the random motions of galaxies in 
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virialized systems stretch groups and clusters into so-called 
"fingers-of-god" (FOG). On large scales, coherent flows cre- 
ated by gravity compress overdense regions along the line of 
sight and stretch underdense regions correspondingly. Small 
and large scale distortions provide diagnostics for the matter 
density parameter fim and the amplitude of mass fluctua- 
tions (Peebles 1976; Sargent & Turner 1977; Kaiser 1987). 
In this paper and its companion, we develop techniques for 
modeling redshift-space distortions that draw on recent de- 
velopments in the theory of galaxy clustering. These tech- 
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niques are designed to reach the level of accuracy demanded 
by the new generation of large galaxy redshift surveys, such 
as the Two-De gree Field Galaxy Redshift Survey (2dFGRS; 
Colless et alT2 0011 and the Sloan Digital Sky Survey (SDSS; 
York et alJl200Ql . 

In the linear theory model of Kaiser (1987), the rela- 
tion of the anisotropic, redshift-space galaxy power spec- 
trum Pz{k, n) to the isotropic, real-space galaxy power spec- 
trum PR{k) is 

Pz(fc,M) =Pfl(fc)(l+/3M')', (1) 

where jj, is the cosine of the angle between the wavevector 
k, and the line of sight. The amplitude of the distortion is 
determined by /3 = Q''^' /bun, where the linear bias parame- 
ter &iin = 5g/Sm is assumed to be independent of scale {5g 
and 6m represent galaxy and mass density contrasts, respec- 
tively) . Fourier transformation of equation Q gives expres- 
sions for the galaxy correlation function in redshift space, 
^{r„,r^) (Hamilton 1992). 

Unfortunately, non-linear effects make equation Q in- 
accurate on all scales where observations yield precise mea- 
surements (Cole, Fisher, & Weinberg 1994). The effects of 
non-linearity can be approximated by a phenomenological 
model in which galaxies have, in addition to linear theory 
distortions, random small scale velocities drawn from an ex- 
pone ntial distribution with dispersion av iPeacock fc Dodd3 
Il994l: Park et al. 1994; Cole et al. 1995). In this model, the 
Kaiser formula becomes 

Pz(fc,^) = Pfl(fc)(l + /3M')'(l + fc''^'MV2)"'. (2) 

In practice, most estimates of /3 from large-scale redshift- 
space distortions have utilized this linear-exponential 
model^, expressed in terms of the power spectrum as in 
equation Q or in terms of the correlation function or spher- 
ical harmonics. The current state-of-the-art measurement 
is the analysis of the 2dFGRS presented by Hawkins et al. 

(2003) , yielding /3 = 0.49 ± 0.09, updating the earlier 2dF- 
GRS analysis of Peacock et al. (2001). Previous observa- 
tional efforts and theoretical developments are expertly re- 
viewed by Strauss & WiUick (1995) and Hamilton (1998). 

The essential limitation of equation Q is that it is de- 
rived from an unphysical model. There are several sources 
of non-linearity in redshift-space distortions in addition to 
small scale dispersion (Cole et al. 1994; Fisher & Nusser 
1996), and the dispersion itself is correlated with the local 
density and is not a constant for all galaxies. Scoccimarro 

(2004) shows that the velocity distribution corresponding to 
the linear-exponential model is itself unphysical, containing 
a J-function and a discontinuity at the origin, and that equa- 
tion (2) does not become fully accurate even at very large 
scales. Hatton & Cole (1999) concluded that this model in- 
troduces a ~ 15% systematic error in the determination of 
/3, which is significant compared to the precision achievable 

^ There are several minor variants of this model that have also 
been utilized, such as replacing the exponential distribution with 
a Gaussian (Peacock & Dodds 1994) or specifying that the pair- 
wise distribution of galaxy peculiar velocities is exponential (e.g. 
Hatton & Cole 1999). 



with 2dFGRS and the SDSS. Furthermore, the cr„ parame- 
ter, while related to the amplitude of the small scale distor- 
tions, has no clearly defined physical meaning. In redshift- 
space distortion analyses it is purely a nuisance parameter, 
significantly degenerate with /3, and has no use in constrain- 
ing cosmological parameters. 

The program initiated by Kaiser (1987) largely sup- 
planted an earlier tradition of using small-scale redshift dis- 
tortions to constrain ilrn via the "cosmic virial theorem" 
(Peebles 1976, 1979; Davis, Geller, & Huchra 1978; Bean 
et al. 1983). The analytic expression of this "theorem" re- 
lied on the assumption of stable clustering, which early N- 
body simulation s showed was unl ikely to hold on the rele- 
vant scales (e.g.. lDavis et al]|l985ll . A more serious problem 
is that the bias between galaxy and dark matter clustering 
is likely to have a complex effect on quantities that enter 
the cosmic virial theorem, one that cannot be captured by 
a single bias parameter with an obvious physical interpreta- 
tion. 

The goal of this paper and its companion is to present 
techniques for physical modeling of redshift-space distor- 
tions that can take advantage of high-precision measure- 
ments on large and small scales. We construct these tech- 
niques in the framework of the Halo Occupation Distribution 
(HOD; see, e.g. Ma & Fry 2000; Peacock & Smith 2000; Sel- 
jak 2000; Benson 2001; Scoccimarro et al. 2001; Berlind & 
Weinberg 2002; Cooray & Sheth 2002), in which the bias 
of a specified class of galaxies is defined by the probabil- 
ity distribution P{N\AI) that a halo of mass M contains 
A'^ galaxies, together with prescriptions for spatial and ve- 
locity bias within individual halos. The HOD has proven 
to be a powerful tool for encapsulating the bias predictions 
of galaxy formation models (Kauffmann et al. 1997; Benson 
et al. 2000; White et al. 2001; Yoshikawa et al. 2002; Berlind 
et al. 2003; Kravtsov et al. 2004; Zheng et al. 2004), for an- 
alytic calculations of galaxy clustering statistics (see Cooray 
& Sheth 2002 and numerous references within), and for em- 
pirical modeling of galaxy clustering data (Jing et al. 1998; 
van den Bosch et al. 20 03; Zehavi et al. 2004a , b; Yang et al. 
2004; Mo et al. 2004; lAbazaiian et al.l l2004t iTinker et all 
[2004?). Several recent papers have presented calculations of 
redshift-space distortions or peculiar velocity statistics us- 
ing halo models of dark matter and galaxy clustering (Seljak 
2001; White 2001: ISheth et al.ll200ll:ISheth fc Diaferioll200ll : 
Kang et al. 2002; Cooray 2004), providing insight into the 
role of non-linear dynamics and non-linear bias in shaping 
clustering and anisotropy. However, these studies primar- 
ily focus on dark matter rather than galaxy clustering, and 
they have not yet yielded a clear blueprint for constraining 
cosmological parameters with HOD modeling of observed 
redshift-space distortions, which is our objective here. 

We use the HOD formulation to set up the redshift- 
space distortion problem in the following terms. Any red- 
shift survey large enough to yield useful measurements of 
large-scale anisotropy will first allow precise measurements 
of the projected correlation function, Wp{rp), which is unaf- 
fected by peculiar velocities. For any choice of cosmological 
parameters, one should choose HOD parameters to repro- 
duce this measurement of real-space clustering. If an ac- 
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ceptable fit cannot be found for the given cosmology, then 
the model is already ruled out (e.g. Abazajian et al. 2004). 
For models with acceptable real-space clustering, one cal- 
culates redshift-space distortions using numerical simula- 
tions or analytic approximations to test the model's cos- 
mological parameters. In practice, the parameters that en- 
ter are Sim and the amplitude of the linear theory matter 
power spectrum Piin(A;), which we characterize by erg, the 
rms linear matter fluctuation in 8 /i~^Mpc spheres (with 
h = -ffo/100 kms~^ Mpc~^). We assume that the shape of 
-Piin(fc) is known from measurements of the large scale galaxy 
power spectrum and cosmic microwave background (CMB) 
anisotropy, which together pin down th e parameters that de- 
termine Piin(fc) qui t e accurately (e . g.. iPercival et all 120021 : 
ISoergel et all l2003l : iTeemark etlo] l2004a^ . ' Since redshift- 
space anisotropy is insensitive to the shape of Pnik) — in 
equations Q and J^J the /i-dependence of Pz(fe) factors out 
entirely — small uncertainties in the shape of Piin(fc) should 
have minimal effect. In this work we adopt the power spec- 
trum form of Efstathiou, Bond, & White (1992), where the 
shape is parameterized by the characteristic wavenumber F. 

While matching Wp{rp) can constrain HOD parameters 
relevant to real-space clustering, we must also allow for the 
possibility that galaxies in a halo have a systematically dif- 
ferent velocity dispersion from that of the halo dark matter. 
(The mean velocity of galaxies and dark matter within a 
halo should be the same because both components feel the 
same large-scale gravitational field.) Numerical simulations 
predict that the galaxy closest to the halo center of mass 
moves at nearly the center of mass velocity while satellite 
galaxies ha ve a velocity dispersion similar to that of the dark 
matter (Ber lind et "311120031: IPaltenbacher et alJl2004^ . We 
define the satellite "velocity bias", a„, as the ratio between 
these two dispersions. Although the numerical simulations 
predict that a„ ~ 1, this parameter could depart modestly 
from unity as a result of dynamical friction, tidal disruption 
or mergers of slowly moving satellites, or different orbital 
anisotropy of galaxies and dark matter. We will treat q„ as 
a free parameter to be constrained by the observations, but 
we will assume that it is constant over the relevant range of 
halo masses. We will also consider effects of non-zero veloc- 
ities for central galaxies, though simulations predict these 
velocities to be < 20% of the virial velocity. 

In this paper we use N-body simulations to create halo 
populations for a set of cosmological models, and we popu- 
late those halos with galaxies using HOD models that yield 
similar real-space clustering. We examine the constraints 
that redshift-space distortions can impose within the three- 
dimensional parameter space (Sim, erg, a^), and we use our 
numerical results to obtain fitting formulas that can esti- 
mate parameters from observational data. In a companion 
paper, we develop a numerically calibrated analytic model 
for redshift-space distortions. The analytic model provides 
physical insight into the numerical results, and it can make 
more complete use of the observational measurements for 
cosmological parameter estimation. 

In Section 2 below, we describe the numerical simu- 
lations and the HOD models used to populate them with 
galaxies. Section 3 presents an overview of redshift-space 



anisotropics in the two-dimensional correlation function 
^{rajT-n)- In §4 we focus on measures of large-scale distor- 
tion based on multipole decomposition of the power spec- 
trum and the correlation function. These measures mainly 
constrain the parameter combination /3 = Q.^ /hg, which 
can be related to ag,Q.^ using the measured (real-space) 
galaxy clustering. (As discussed in §4.4, we define bg by a 
ratio of non-linear correlation functions, which makes it sim- 
ilar but not identical to the linear theory bias factor feun.) 
In §5 we turn to small scale distortions, which most directly 
constrain fimCiS and have some power to break degeneracies 
further and yield separate determinations of Q.m, o"g, and 
a„. In §6 we summarize our results and discuss how they 
can be applied to cosmological parameter estimation from 
observational data. 



2 NUMERICAL SIMULATIONS AND HOD 
MODELS 

2.1 N-body Simulations 

We use N-body simulations to create halo populations for 
a sequence of cosmological models, always assuming a spa- 
tially flat universe dominated by cold dark matter and a 
cosmological constant (ACDM), with Gaussian initial con- 
ditions and a primordial power spectrum motivated by ob- 
servations of CMB anisotropics and large-scale structure. 

We choose the mass resolution by requiring that there 
be at least 30 particles in the lowest mass halos that host 
simulated galaxies. On this basis we select a mean inter- 
particle separation of n~^^^ = 0.7 /i~^Mpc for all initial 
conditions. For Sim = 0.3, the 30-particle limit corresponds 
to a minimum halo mass of ~ 10^^ /i~^Mq, similar to the 
minimum halo mass found for the HOD flt (assuming Sim = 
0.3) to the SDSS sample of galaxies brighter than Mr — 
-20 + 5 log ft (Zehavi et al. 2004b). AU of our simulated 
galaxy populations have a space density of ftg — 5.6 x 1Q~^ 
(/i~^Mp c)~^, equal to that of SDSS galaxies brighter than 
0.68 L, llBlanton et al.ll2003^ . or = -20.04 + 5losh. 

To cover the (f2m,o"g) parameter space in an efficient 
manner, we draw on the findings of Zheng et al. (2002), who 
demonstrated that changes in Sim at fixed F and erg simply 
scale halo masses in proportion to Sim and halo velocities in 
proportion to Sim*". In terms of these scaled masses and ve- 
locities, the mass function, spatial correlations, and velocity 
correlations of halos identified at fixed overdensity are vir- 
tually independent of Sim. We can therefore run a single 
simulation that has a high value of as at redshift zero and 
use the earlier redshift outputs to represent z — results 
for lower values of erg. For each erg, the halo population can 
be scaled to any desired value of Sim. Specifically, we run 
simulations with Sim = 0.1 and erg = 0.95 at z=0, and use 
the outputs at z=0.19, 0.56, 0.97, and 1.45 when (Sim, erg) 
= (0.16, 0.90), (0.30, 0.80), (0.46, 0.70), and (0.62, 0.60). 
We model different values of Sim by scaling the halo masses 
in proportion to Sim, the halo velocities by Sim'', and the 
internal halo velocity dispersions by Slm^. We carry out a 
test of this scaling in §2.3 to demonstrate that it is accurate 
enough for our purposes here. 
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Table 1. Properties of the Simulations and HOD Parameters 



0.100 0.950 0.922 3.73x10" 9.38x10^"^ 0.934 

0.19 0.158 0.900 0.956 5.96x10^1 1.45x10^3 0.959 

0.56 0.297 0.801 1.041 1.09x10^2 2.51x10^3 1.005 

0.97 0.459 0.699 1.181 1.70x10^2 3.51x10^3 1.109 

1.45 0.620 0.599 1.358 2.19x10^2 3.99x10^3 1.199 



Note. — When we scale an output to a different value of Qm, the values of M^in and Mi scale in proportion to Qm, as discussed in 




-3 -2-10 1 

log (k [h/Mpc]) 



Figure 1. The two power spectra used in the simulations, 
r = 0.2 and P = 0.12, are compared to the linear power spec- 
trum computed with CMBFAST using the parameters listed in 
column 6 of Table 4 in Tegmark et al. (2004b). All three power 
spectra are normalized to the same value of erg. The fundamen- 
tal mode of the simulation volume is indicated with the arrow at 
k = 27r/Lbox = 0.025 fc/Mpc. The P = 0.2 power spectrum is 
similar to the CMBFAST calculation over the range of scales sim- 
ulated, while the P = 0.12 power spectrum has more power at 
large scales and increasingly less as k increases. 



We analyze simulations with two values of the power 
spectrum shape parameter, F = 0.2 and 0.12, both with in- 
flationary spectral index Us = 1. On the scales probed by 
our simulations, F = 0.2 corresponds well to the power spec- 
trum calculated with cmbfast {Scliak fc Zaldarriaga 1 99^1 
with Qm = 0.3, h = 0.7, and fib = 0.04, va lues favored by 
recent observations (e.g., Spergel et al. 2003: lTtegmark et alJ 
l2004d) . The redder, F = 0.12 power spectrum corresponds 
to a lower combination of flmh, or a tilted (n^ < 1) primor- 
dial spectrum. This model is at the extreme edge of those 
allowed by current data, so comparing results for F — 0.2 
and F = 0.12 should give a conservative estimate of un- 
certainties associated with the power spectrum shape. In 
Figure we compare these two power spectra to one cre- 



ated with the transfer function calculated by cmbfast us- 
ing the cosmological parameters listed in Table 4 (column 6) 
of Tegmark et al. (2004b), who derive combined constraints 
from WMAP CMB data, and the SDSS galaxy power spec- 
trum. Each power spectrum is normalized to the same value 
of ag. The fundamental mode of the box is marked with the 
arrow. Inside this scale, the F = 0.2 power spectrum closely 
tracks the cmbfast calculation. The F = 0.12 P{k) has less 
small-scale power, but it has significantly more power at 
scales near the fundamental mode. 

We use the publicly available tree-code gadget 
(Springel, Yoshida, & White 2000) to integrate the ini- 
tial conditions. We evolve 360^ particles in a volume 253 
on a side, giving us a mass resolution of 9.66 x 
10^" x Qrn h~^MQ per particle. The force softening was set 
to one-tenth the mean interparticle separation, or e = 70 
/i^^kpc. The simulations were started at an expansion factor 
a — 0.01, with a maximum timestep of 0.005 in a. gadget 
employs individual particle timesteps governed by a parti- 
cle's acceleration, such that Aa oc yfer\. The value of x] was 
set to 0.2. We ran five independent realizations to estimate 
the sample variance. 

We also ran a similar series of simulations using the 
particle-mesh (PM) te chnique , with a st aggered- mesh algo- 
rithm similar to that of lMelotd lll983l) and lParkUl990h . (The 
code we use was written by V. Narayanan.) The high effi- 
ciency of the PM algorithm allowed us to run simulations 
with the same mass resolution but box sizes of 324 /i~^Mpc 
per side, twice the volume of our gadget runs. In com- 
paring the results from the two methods, we found that the 
lower force resolution of the PM technique (with a 900^ grid) 
had a significant impact on the number of halos near our 
30-particle resolution limit, while the smaller volume of the 
GADGET runs did not adversely affect the distortions at large 
scales. We therefore use the gadget runs exclusively in our 
subsequent analyses. 



2.2 HOD Models 

To identify halos in the dark matter distribution we use 
the friends-of-friends algorithm (Davis et al. 1985) with a 
linking length of 0.2 times the mean interparticle separation. 
Objects identified with this linking length typically have an 
average density of p/p ~ 200, which is roughly the criterion 
for virialization of a collapsed object. Only halos with 30 or 
more particles were retained in the halo sample. 
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Table 2. Properties of the Mock Galaxy Distributions 
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Note. — In the first three sequences, a„ = 1 and a^c = 0. The HOD parameters and bias factors bg for each value of erg are listed in 

Table 1. 



We need to populate the halos with galaxies in a way 
that generates similar ^R{r) for all values of as- We use 
the HOD parameterization of Kravtsov et al. (2004) and 
Zheng et al. (2004) , which was also adopted in the empirical 
modeling of the SDSS correlation function by Zehavi et al. 
(2004b). Halos above a minimum mass Mmin are assigned 
one central galaxy. The mean number of satellite galaxies in 
halos with Af ^ Mmin is 



lAT \ I 
(A/sat )Af = ( 



(3) 



The mean number of galaxies in a halo is therefore (A^) m ~ 
1 + (M/Mi)" for M ^ Mmin and {N)m = for M < 
Mmin- We assume Poisson scatter in the number of satel- 
lite galaxies with respect to the m ean (A^sat)M, co n sisten t 
with the theoretical predictions of iKravtsov et aTl i2004l) . 
and Zheng ct al. ( 2004). 

We adopt the parameter combination {Q,m.,o-s) = 
(0.3, 0.8) for our central model. To populate the halos in this 
model, we choose observationally motivated HOD parame- 
ters similar to those derived for t he SDSS Adr < —204-5 log h 
galaxy sample bv Ze havi et alJ (i2004bj. The resulting cor- 
relation function is shown by the solid line in Figure |5| For 
other as values, we choose Mi, a, and Mmin so that we 
closely match fj?(f) of the central model, while maintain- 
ing a fixed galaxy space density. We carry out the HOD 
pa rameter fits u sing the analytic model of ^R{r) described 
bylTinkere^^J (|2004), which refines the model described 
bv lzhen^T200^ . The cosmological and HOD parameters of 
our simulations are listed in Table 1. 

We assume that satellite galaxies trace the dark mat- 
ter distribution within halos; a test in §3 below shows that 
our results are insensitive to this assumption (see Figure |7||. 
Instead of selecting random dark matter particles from the 
friends-of-friends halos, we randomly place satellite galaxies 
in each halo following the universal halo profile of Navarro, 
Frenk, & White (1997; hereafter NFW). This technique 
makes our results insensitive to numerical force resolution 
or to discreteness effects on halo structure and velocity dis- 
persions. It also allows for easier comparison to analytic ap- 
proximations, since the N-body halo population is better 
controlled and characterized. Most importantly for our pur- 
poses, it allows us to choose halo concentrations appropri- 
ate to each combination of erg and ilm, using the methods 




100 



r [h"' Mpc] 



Figure 2. The real-space galaxy two-point correlation functions 
for the five cosmologies and HOD parameters listed in Table 1. 
The inset box shows the different correlation functions normal- 
ized by that of the central model, Qrn = 0.3, as = 0.8. The error 
bars in the inset box are those for the central model. Results are 
averaged over five realizations, and error bars show the run-to- 
run dispersion divided by \/N — 1 = 2 to calculate the error in 
the mean. In both panels, the dash-dotted line is the correlation 
function for the central model's cosmological and HOD parame- 
ters but the r = 0.12 initial power spectrum. 



of lBullock et alJ ll200j) and lKuhlen et alJ ^l2004^.^ T h e sim - 
ple scaling of halo properties found bv IZheng et alJ ll2002h 

does not extend to internal structure, which depends sys- 



^ We use the Bullock et al. (2001) method of calculating Cvir = 
Rvir/Ts, where Rvir is the virial radius of the halo and Ts is the 
NFW scale radius. The virial overdensity used bv lBuUock et all 
J2OOII) depends on Qrn and can be significantly different from 
the 200 assumed here. To correct for this, we calculate c^ir for 
a given halo mass M^ir, then calculate the corresponding M200 
(since the halo mass depends on the defined edge of the halo) and 
scale the concentration by ii20o/^vir- See Hu & Kravtsov (2002) 
for details. 
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tematically on flm- When creating galaxy populations for 
models with different f2,„ but the same ug, we change halo 
concentrations appropriately but keep the HOD parameters 
fixed. This procedure leads to small differences in ^R{r) from 
model to model, but these have negligible impact on our 
redshift-space distortion results. We discuss concentration 
effects at the end of §3. 

We draw line-of-sight velocities of satellite galaxies (rel- 
ative to the halo center-of-mass) from a Gaussian distribu- 
tion with dispersion 

/ GM 

where R200 is the radius at which the mean density of the 
halo is 200 times the background density. For — 1, this 
choice corresponds to the velocity distribution of an isother- 
mal sphere. Although a literal interpretation of a„ 7^ 1 is 
that the satellite population is "colder" or "hotter" than the 
dark matter particles, a modest departure from unity can 
also account for orbital anisotropy and non-isothermality. 
In tests of anisotropy we find that a model with one- 
dimensional velocity dispersions such that cr^/cr^ — 2 and 
/al = 0.5, where i, j, and k are orthogonal directions ran- 
domly oriented with respect to the axes of the box, produces 
quantitatively similar results to a model with q„ = 0.8. 

We use a similar technique for the velocities of central 
galaxies, but here our standard assumption is that the ve- 
locity bias parameter a^c ~ 0. We also consider a model in 
which the central galaxies have modest velocities, a^c ~ 0.2, 
and an extreme model with a^c ~ 1- We also consider mod- 
els with satellite a„ = to isolate the physical effects of 
the virial dispersion from those of the halo velocities. The 
av — models are also relevant to observational analy- 
ses that employ "FOG compression", i.e., identification and 
compression of galax y groups in redshift space (see, e.g., 
iTegmark et al.ll2004a^ . If this technique works perfectly, it 
effectively sets a„ = in all halos. 

Figure 121 shows real-space galaxy correlation functions 
for r = 0.2 and erg = 0.6,0.7,0.8,0.9, and 0.95 (see Table 
1 for exact values). Results are averaged over five realiza- 
tions, and error bars show the run-to-run dispersion divided 
by \''N — 1 = 2 to yield the error in the mean. The inset 
box shows the deviation of (,R{r) for each model relative to 
that of the central (fim = 0.3, as — 0.8) model. The models 
with as 0.7 match the central model to < 5% at r < 20 
h~^Mpc. At larger scales, finite box effects make the devia- 
tions larger than 10%, but these are smaller than the statis- 
tical errors. The as ~ 0.6 model matches the central model 
to 5% or better at most r, but it deviates by ~ 15% around 
0.8 /i~^Mpc. At roughly this scale there is a transition be- 
tween one-halo and two-halo galaxy pairs, and the effects of 
as on the halo mass function are difficult to overcome with 
P{N\M) changes. 

The dot-dash curve in Figure |21 shows ^R{r) for the 
fim = 0.3, as = 0.8, r = 0.12 model. With this large change 
in the shape of the matter power spectrum, it is impossible 
to choose HOD parameters that make the galaxy correlation 
funct ion match that of the F = 0.2 models, or the SDSS 
data iAbazaiian et al ] |2004) . Instead, for this set of models 



we use the same HOD parameters found for the correspond- 
ing as value in the F — 0.2 runs. The spread among ^R{r) 
for the five F = 0.12 models is comparable to that for the 
F = 0.2 models. At r < 2 ^~^Mpc, however, the spread is 
approximately twice as large. 



2.3 Velocity Scaling 

Figurel^tests the efficacy of the mass/velocity scaling tech- 
nique described in §2.1. For this test, we ran two new sets 
of GADGET runs, each set comprised of five simulations with 
200"^ particles in a 200 h~^Mpc box. One set has (fim, erg) = 
(0.1,0.8) at z = 0, the other has (nm,crg) = (0.4,0.8) at 
2 = 0. In both cases we chose HOD parameters Mmin and 
Ml corresponding to 30 and 600 particles, respectively, with 
a = 1. 

Panel (a) in Figure |3 shows contours of the redshift 
space correlation function, £,{rcr, r^), where represents the 
projected separation between two galaxies and r-n the line-of- 
sight separation. This way of representing the data is widely 
used in observational studies, such as Peacock et al. (2001) 
and Hawkins et al. (2003). We use the distant observer ap- 
proximation, so simply becomes the redshift distance be- 
tween galaxy pairs along one dimension of the box, account- 
ing for the periodic boundary condition. Here correlation 
functions are averaged over three projections of five realiza- 
tions for a total of fifteen measurements. The higher density, 
f^m = 0.4 model shows stronger compression of contours at 
large scales because of larger coherent flows, and it shows 
stronger FOG distortions at small scales because of larger 
dispersions between and within halos. 

Panel (c) presents the same data in a different fashion. 
Each line in the panel represents the value of ^(rcr,»"^) as a 
function of at a given r^, a slice in the — plane. At 
= 0.12 /i^^Mpc, the Qrn ~ 0.4 model starts at a lower 
value of ^(rCT,r^) but remains horizontal for a longer range 
of r^. The extended horizontal plateau reflects the longer 
FOGs in the higher density model, and since the pairs at 
small ra are spread over a larger range of r,r, the amplitude 
near = is necessarily depressed. We will use the turnover 
of ^(rTr) at small as a quantitative measure of small-scale 
distortions in §5. At the bottom of panel (c), where the lines 
represent ra = 11 /i~^Mpc, the Q,rn = 0.4 line is above the 
f2m =0.1 line because of the large amplification of clustering 
in the coherent infall regime. 

In the right panels, (b) and (d), we have scaled the 
velocities of the halo and galaxy populations of both mod- 
els to ^Im ~ 0.25 in the manner described in §2.1, keeping 
HOD parameters fixed in particle number (and thus scaled in 
mass proportional to fim). In both manners of representing 
the data, the correlation functions are nearly indistinguish- 
able. In other words, we can scale an Qrn = 0.1 model to 
Qm = 0.25 and Qm = 0.4 model to Qrn ~ 0.25 and get the 
same result. Figure [^demonstrates that our velocity scaling 
technique can be applied to our simulations without accru- 
ing systematic errors at either large or small scales. 
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Figure 3. Test of the Om-scaling procedure. For this test, we use simulations with 200'' particles in a 200 h-^Mpc box and HOD 
parameters a = I and Mmin and Mi corresponding to 30 and 600 particles, respectively, (a) The correlation function in redshift space, 
^(''o-i ''•n-)i for f^m = 0.1 (dotted) and Qrn = 0.4 (solid). The contours represent lines of constant correlation separated by factors of 
two, with the outermost contour representing 2~*. (b) 5(ro-,r,r) for the same models, but now the galaxy velocities have been scaled to 
= 0.25. (c) Same models as (a), but now the different lines represent slices in the — r.^ plane at different To- : 0.12, 0.3, 1, 3, and 
11 /i~^Mpc from top to bottom, (d) The results from the two simulation sets, scaled to Qrn = 0.25, are plotted for the same slices in 
the rcr — Tr: plane. Results in all panels are averaged over three projections of five realizations, and error bars in (c) and (d) show the 
run-to-run dispersion divided by \/N — 1 = \/T4 to calculate the error in the mean. Errors are only plotted for the solid lines to avoid 
crowding. 



3 OVERVIEW OF C (ra , r ^ ) 

Figure 0] encapsulates the dependence of the redshift- 
space correlation function, ^(r^-jr.^), on position in the 
(f2m,(jg,a„) parameter space. Each panel shows contours 
of 5(rCT,r^), separated by factors of two, for a sequence of 
models in which two parameters or parameter combinations 
are held fixed and one is allowed to vary. Recall that these 
variations in cosmological parameters or velocity bias are 
carried out at fixed (or nearly fixed) real-space galaxy clus- 
tering, as shown in Figure |5| The green contours in each 
panel of Figure 2] show the central model with fim = 0.3, 
(Tg — 0.8, Q„ — 1.0, Q„c = 0, and all models have F = 0.2. 

In panel (a), blue and red contours show models with 
erg = 0.6 and 0.95, respectively, still with f2m = 0.3 and 
a„ = 1. As (Tg increases, ^(ro-,r^) contours become more 



flattened because the amplitude of coherent flows increases 
with larger dark matter fluctuations. In terms of equation 
Q, higher ag means a lower galaxy bias factor for fixed 
galaxy clustering amplitude, and thus a higher value oi (3 = 
n°J/bg. In the FOG regime at small r^, contours of the 
three models are nearly degenerate at r.^ ^ 10 /i~^Mpc. 
At these scales, most galaxy pairs are common members of 
intermediate mass halos, and the FOG distortion depends 
on the masses of those halos. The halo mass function is only 
weakly dependent on erg at these intermediate masses, so the 
contours converge. However, a high-erg model has more high 
mass halos with large virial velocity dispersions, so at large 
the contours extend further for higher ag. 

Figure shows a model sequence in which erg = 0.8, 
Oi, — 1, and Sim = 0.1 (red), 0.3 (green), and 0.5 (blue). The 
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[h"' Mpc] [h"' Mpc] 



Figure 4. Contour plots of ^{rcr,rn) in the ra — r,r plane, for the four sets of simulations listed in Table 2. The contours are separated by 
factors of two in ^(rCT,r,r), with the outermost contours representing 5(ro-,r,r)= 2"*. (a) Models with f2m = 0.3, Oy = 1, and erg = 0.95 
(red), eg = 0.80 [green), erg = 0.60 (blue), (b) Models with trg = 0.8, a„ = 1, and Q^n = 0.1 (red), Q^n. = 0.3 (green), Q^n = 0.5 (blue), (c) 
Models with /3 = n^'^/bg = 0.46, a„ = 1, and trg = 0.95 (red), erg = 0.80 (green), as = 0.60 (blue), (d) Models with = 0.3, trg = 0.8, 
and a„ = (red), «„ = 0.8 (green), o„ = 1.2 (blue). 



flattening of contours at large and elongation at small 
To- both increase with Qm, since a higher density universe 
has larger amplitude coherent flows and more massive ha- 
los. While the large scale distortions have a similar qualita- 
tive dependence on ag and Qm, the FOG distortions show 
an important difference. Changing Qm shifts the halo mass 
function coherently at all masses, but changing ag shifts the 
high and low ends of the mass function in opposite direc- 
tions, with little change at intermediate masses. As a result, 
the FOG contours converge for the varying as sequence in 
panel (a) but not for the varying Qm sequence in panel (b). 

In panel (c), we again vary erg from 0.6 to 0.8 to 0.95, 
but for each value of erg we choose the value of Qrn that 
keeps the combination /3 — Q^;^ /hg constant. Note that hg 
is approximately proportional to cr^^, so this sequence has 
approximately constant as^'i^ , but not exactly (see §4.4 for 
further discussion). Here the contours overlap almost per- 
fectly on large and intermediate scales, and they are simi- 
lar even in the FOG regime. While linear theory does not 



predict the form of £,{ra, Ttt) accurately even on the largest 
scale shown (see Figure |S| below), it correctly predicts that 
the class of models with constant /3 is nearly degenerate 
with respect to redshift-space distortions. The differences in 
the FOG regime, though difficult to see on this Figure, will 
nonetheless prove sufficient to distinguish models with the 
same (3 but different erg. 

In panel (d) we explore the effect of velocity bias. This 
sequence uses the central values of = 0.3 and erg = 0.8, 
(and thus has constant /3), with equal to 0, 0.8, and 1.2. 
For clarity, we omit the = 1 model from the plot. The 
ai, = model, which would represent measurements from 
a data set with perfect "FOG compression," has elliptical 
contours at all scales, with no trace of the elongation at 
small Ta- Since velocity bias is applied only within halos, 
these contours show that FOG distortions in ^(rcr,r,r) arise 
entirely from halo internal velocity dispersions. At larger 
scales, the = model begins to coincide with the others 
when Ta > r'-TT- The models with a„ = 0.8 and 1.2 diverge at 
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Figure 5. Redshift-space correlation functions for the same models as in Figure|l] now plotted in the slice format, which allows easier 
identification of the small-scale distortion. In each panel, curves represent r^r = 0.12, 0.46, 2.54, and 14.2 /i~^Mpc from top to bottom, 
(a) Model with = 0.3, a„ = 1, and ag = 0.95 (dotted), erg = 0.80 (solid), erg = 0.60 (dashed), (b) Models with ag = 0.8, = 1, and 
= 0.1 (dotted), n„i = 0.3 (solid), n„i = 0.5 (dashed), (c) Models with /3 = /bg = 0.46, a„ = 1, and erg = 0.95 (dotted), ag = 0.80 
(solid), and erg = 0.60 (dashed), (d) Models with Q^n = 0.3, erg = 0.8, and a„ = (dotted), = 0.8 (solid), a„ = 1.2 (dashed). Errors 
are the run-to-run dispersion divided by \/l4 to calculate the error in the mean. Error bars are only plotted for the solid lines to avoid 
crowding. 



approximately the same location, with higher a„ resulting 
in a stronger FOG effect. The small scale dispersion affects 
any global measure of the shape of ^(ro-,r.^) contours, such 
as quadrupole-to-monopole ratios, but it has only a small 
effect at large Vcr and r.^. We have also created two models, 
not shown in this figure, with no satellite velocity bias but 
with Que = 0.2 and 1. These models will be discussed in 
subsequent sections. 

For the remainder of the paper, we will refer to these 
four model sequences by writing the parameters that are 
held constant in square brackets. Panel (a) plots the [$7^,011] 
sequence, panel (b) plots the [(J8,q„] sequence, panel (c) 
plots the [/3,Q!u] sequence, and panel (d) plots the [fimjCrg] 
sequence. The values of ^Im, crs, /9, Ov, and a^c for these 
four model sequences are listed in Table 2. 

Figure |^ plots the same results as Figure 0] but now 
in the form of Figure |21 showing slices at fixed values of 



rcr- For each model, the top two curves trace out the FOG 
distortions at « 0.12 ft~^Mpc and = 0.46 /i"^Mpc, 
allowing discrimination of models in the FOG regime that 
is difficult from the contour plots alone. 

In panel (a), changes in ag at fixed [57m, Qd] have only 
a small effect on the FOG distortions at = 0.12 /i^^Mpc, 
though even these changes are significant relative to our sta- 
tistical error bars, which are comparable to the line width. 
At To- — 14 /i~^Mpc, the high-as model has higher ^(ro-, r^) 
at all Vtt, but the large scale distortions are more difficult to 
discriminate in this representation compared to the contour 
plot (Fig.Elj,). 

In the remaining panels, parameter changes have a 
marked effect on the FOG distortions at small r^. In par- 
ticular, the models with constant [/3, a„], which have nearly 
identical large scale distortions, show a ~ 40% change in 
C('"o-,f'7r) at small (rCT,r^) as ag rises from 0.6 to 0.95 (Fig. 
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Figure 6. Numerical results for the central model (Hm = 0.3, erg = 0.8, = 1) compared to the best fit linear-exponential model. The 
solid lines are the numerical results and the dotted lines are the model. To fit the data, the exact value of /3 was used, and the best fit 
dispersion trg = 418 kms~^ was found by minimization. Contour levels and r„ values are the same as those used in Figures El and ISl 
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Figure 7. The influence of HOD parameterization or halo concentration on predicted redshift-space distortions, when HOD parameters 
are chosen to yield the same real-space galaxy correlation function. All models assume Qrn = 0.3, erg = 0.8, Qy = 1. The top row compares 
results from our standard three-parameter HOD to the five- parameter HOD of Zheng et al. (2004). (a) Mean occupation functions {N) ]\,{ 
for the three-parameter HOD (solid line) and the five-parameter HOD (dotted line). The dashed curve is the satellite contribution to 
(Af)jVf for the five-parameter HOD. (b) ^{rcr,r.^) contour plots for the three-parameter HOD (solid line) and the five-parameter HOD 
(dotted line, virtually indistinguishable), (c) Slices in ro- — r,r for the three-parameter HOD (solid lines) and five-parameter HOD (dotted 
lines), (d) — (f): Similar to (a) — (c), but dotted curves represent a model in which concentrations have been reduced by 30% and the 
HOD parameters (in the three-parameter model) have been adjusted to maintain the small-scale correlation function. 



\^)- While the separation of hnes is not dramatic on a plot 
spanning five decades on the i/-axis, differences of tens of 
percent should be easily measurable at these scales in the 
samples the size of the 2dFGRS and SDSS. Changing a„ 
from 0.8 to 1.2 has an effect of similar magnitude, though it 
differs in detailed form (Fig. |3|i) . 

Figure |S] compares our numerical results for the cen- 
tral model (fim = 0.3, erg = 0.8, a^j = 1) to the analytic, 
linear-exponential model of equation (|5J. We fix P to the 



true value of 0.46 and vary cr„ to minimize for all data at 
separations larger than 10 h~^Mpc (we get similar cr„ if we 
use data at all separations). The linear-exponential model 
describes the large scale distortions fairly well, though even 
here there are systematic differences between the numeri- 
cal ^(ro-,r^) contours and the model fit. The model does 
a poor job of replicating the FOG distortions at large r^, 
a failure that is evident in both the contour plots and the 
line plots. These deficiencies of the linear-exponential model 
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can also be seen in its application to the 2dFGRS data by 
Peacock et al. (2001, see their Figure 2). There, the mea- 
sured distortions at small ro- clearly extend past the model 
predictions, even though the FOG effect has been smoothed 
relative to our plots here by the larger bin size. We can force 
the linear-exponential model to better match the FOG dis- 
tortions by adopting a higher a^, but the fit at large scales 
is then severely degraded. 

When analyzing observational data, we must infer the 
galaxy HOD by fitting parameterized models to the mea- 
sured real-space clustering (e.g., the projected correlation 
function). We anticipate that redshift-space distortions will 
be insensitive to the adopted HOD parametrization so long 
as the model reproduces the observed real space correlation 
function. Figure demonstrates the validity of this conjec- 
ture. We first populate the halos of our as = 0.8, Qm = 0.3 
N-body simulations using a five-param eter HOD model fi t 
to results of a hydrodynamic simulation iZheng et al .|2004) . 
in which the galaxy space density is 2.5 x 10~^ (/i~^Mpc)~^. 
This parameterization incorporates adjustable smooth cut- 
offs in the central and satellite galaxy mean occupation func- 
tions, and it can achieve an essentially perfect fit to the 
predictions of semi-analytic and numerical models of galaxy 
formation fiZheng et aL .20041 . We then fit parameters of 
our restricted, three-parameter HOD model to reproduce the 
correlation function of the five-parameter model as closely 
as possible, obtaining agreement similar to that in Figure 
|5| Figure shows the original and fitted mean occupa- 
tion functions, and Figures 03 andQ: show ^(ro-,r^) for the 
two models, in the format of Figures |1| and |K| respectively. 
While the sharp cutoff model cannot represent the {N}m 
of the input model exactly, it predicts essentially indistin- 
guishable redshift-space distortions. The large scale distor- 
tions for both models are weaker than those in Figures|l|and 
13 because our HOD parameters are matched to a strongly 
clustered galaxy sample with higher bg and consequently 
lower /3. 

As discussed in §2.2, our HOD models assume that 
satellite galaxies in halos have the same radial profile as the 
dark matter. If we change this assumption when fitting the 
observed correlation function, or if we make this assumption 
but it does not hold in the real universe, then we will de- 
rive slightly different HOD parameters, which in turn will 
change the redshift-space distortions. We test our sensitiv- 
ity to the radial profile assumption by creating a model that 
matches ^R{r) of our standard central model but uses satel- 
lite profile concentrations 30% lower than those of the dark 
matter halos themselves. Figure Ql shows the mean occu- 
pation functions of the two models. The low concentration 
model has a lower Mi to create more close one-halo pairs, 
and a lower a to prevent overpopulation of massive halos. 
Figures and |7f show the redshift-space distortions of the 
two models. The large scale distortions of the two models 
are the same, apparent from both the contour plots and the 
line plots. The low concentration model has slightly weaker 
fingers-of-god because it has fewer galaxies in massive halos, 
but this difference is barely distinguishable in Figure|7f, and 
the difference in the quantitative measures of small scale dis- 
tortion measures introduced in §5 is within our statistical er- 



rors. We conclude that departures from the standard radial 
profile by 5C 30% do not alter our results. Still larger changes 
might have noticeable effect, since the inferred HODs would 
predict different non-linear velocity fields, but substantial 
departures from theoretically predicted dark matter pro- 
files can be detected observationally by measuring satellite 
galaxy profiles in groups and clusters. 



4 MEASURES OF LARGE-SCALE 

DISTORTION AND THE VALUE OF /3 

The blueprint for cosmological parameter estimation begins 
at large scales. At these scales, anisotropies are governed 
by the value of /3 = fiJii^/feg (see Figure gj- The effects 
of velocity bias are limited and, we will show, straightfor- 
ward to remove. Values of bg for our five values of as are 
listed in Table 1. We define galaxy bias factors by the ratio 
of the non-linear, real-space galaxy and matter correlation 
functions in the range 4 r ^ 12 /i~^Mpc, b^ = ^g/^m, 
a choice that we discuss further in §4.4 below. Changing 
the range to 10 ^ r ^ 25 h~^Mpc changes the values by 
< 1%. In characterizing distortions of the power spectrum 
or correlation function, we follow the track of Kaiser (1987), 
Hamilton (1992), and Cole et al. (1994), using either the 
ratio of the angle-averaged redshift-space quantity to the 
real-space quantity, or the ratio of the quadrupole moment 
to the monopole in redshift space. The two methods applied 
to two statistics provide four measures of large scale distor- 
tions, illustrated by Figures ISt ll 1 1 below. 

4.1 The Power Spectrum 

The angular dependence of the redshift-space galaxy power 
spectrum can be characterized as a sum of Legendre poly- 
nomials, denoted here as Li{fi), 

oc 

This equation can be inverted to determine each individual 
multipole by 

m)-^-j Pz{k,^,)Li{p.)d^,. (6) 

Statistical symmetry of positive and negative peculiar veloc- 
ities guarantees that odd multipoles vanish on average. In 
linear perturbation theory, only the Z = 0, 2, and 4 moments 
are non-zero. Equations (0 and Q yield 

J'o(fc) = (l + |/3+i/3')p«(fc), (7) 

J'2(fc) = (|/3+^/3')Pfl(fc), (8) 

for the monopole and the quadrupole, where Pnik) is the 
real-space power spectrum. In linear theory, the angle- 
averaged redshift-space power spectrum Po{k) is amplified 
over the real-space power spectrum by a constant factor, 
and the enhancement of fiuctuations along the line of sight 
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produces a positive quadrupole P2{k) with the same shape 
as PR{k). The ratio of the monopole to the real-space power 
spectrum, Pq/r, or the quadrupole-to- monopole ratio, P2/O1 
are scale-independent functions of /3: 



Po/R{k) 



P2/o(k) 



PR{k) 3 5 



P2{k) 
Po(fc) 



(9) 



(10) 



However, non-linear effects, especially the velocity disper- 
sions in collapsed or collapsing structures, suppress Po(k) at 
smallest scales and cause the quadrupole to actually reverse 
sign in the non-linear regime. In practice, the ratios Pq/h 
and P2/0 a-re monotonically decreasing functions of k, and 
equations Q and llOII do not provide accurate estimates of (5 
at scales accessible to high-precision measurements. The use 
of the linear-exponential model (eq.|5J in place of pure linear 
theory (eq. can greatly improve the accurac y of [3 esti- 
mates, but it still does not r emove biases entirely llCole et alJ 
ll995l:lHatton fc Colelll99i) . 

To calculate the redshift-space galaxy power spectra 
for our simulations, we use the same technique as Berlind, 
Narayanan, & Weinberg (2001). In the distant observer ap- 
proximation, we take an axis of the box as the line of sight, 
wrap particles around the periodic boundary if their peculiar 
velocities shift them outside the box, and calculate Pz{k,^) 
by Fast Fourier Transform. We use a 200^ density mesh and 
treat each axis as an independent line of sight. The multipole 
moments are calculated by fitting the first three even terms 
in equation (|SJ. We compute the average from 15 measure- 
ments (three projections of five simulations) and the errors 
by dividing the run-to-run dispersions by -\/T4. Figures |H] 
andl^show the results of this analysis for Pq/r and P2/O1 re- 
spectively, as functions of wavelength X = 2n/k. Horizontal 
dotted lines represent the values of Pq/r and P2/0 predicted 
by linear theory (eas. l^and llOH . 

FigureslHK andlHb plot Po/R{k) for varying erg and Qm, 
respectively. At large A, Pq/r increases with increasing /3. 
But all the curves drop rapidly at scales A ~ 30 h~^Mpc 
due to non-linearities. The difficulty in using linear theory 
to extract (3 is easily seen; none of the models shows a clear 
asymptotic value of Pq/r- An estimate of the linear theory 
value might be possible for the lowest value of ft m or (7 S , 
but as either parameter increases the slope of the curve at 
large A becomes larger. At /3 > 0.4 the data never converge 
to the large-scale horizontal asymptote predicted by linear 
theory, even at the fundamental mode of the box. 

For constant [/3, q„], in panel (c), the curves are nearly 
identical within the error bars, especially at large scales. 
Thus, even though linear theory does not yield an accurate 
estimate of (5, it predicts the scaling of Pq/r with cosmo- 
logical parameters almost perfectly, quantifying the visual 
impression of Figure |1J;. In panel (d), the behavior of the 
a„ = model demonstrates that random dispersion in viri- 
alized groups plays a dominant role on suppressing Pq/r. 
With the virial motions eliminated, the data for this model 
remain nearly constant over more than a decade in A, with 
the other curves only meeting it at A ~ 100 /i~^Mpc. A suffi- 



ciently effective FOG compression technique might therefore 
allow useful estimation of /3 from linear theory and Pq/r- 

The other velocity bias models begin to diverge from 
each other at A ~ 70 /i~^Mpc, again demonstrating that 
cluster virial velocities affect redshift distortions well into 
what is normally considered the linear regime. If we allow 
central galaxies to move with respect to the halo center-of- 
mass with bias a^c ~ 0.2, we find barely detectable changes 
(the line cannot be seen because it is directly beneath the 
line for the central model). We also plot the model with 
Uvc = 1, in which the central galaxy random velocities 
are the same magnitude as those of dark matter particles. 
At small scales, adding large central galaxy velocities has 
roughly the same effect as increasing the satellite velocity 
bias to Qi, — 1.2, but the a^c = 1 model converges with the 
central model somewhat faster. 

Results for the quadrupole-to-monopole ratio are shown 
in Figure |3] The model dependence of P2/0 is qualitatively 
similar to that of Pq/K) though the use of a higher order mul- 
tipole leads to substantially larger statistical error. As with 
Pq/r, the P2/0 curves only reach a large scale asymptote for 
the lowest values of /3. Once again, however, linear theory 
correctly predicts that models with constant /3 have the same 
large scale distortions. For the fixed [Om,o"8] model set in 
panel (d), the — model is consistent with linear theory 
at A > 20 h~^Mpc. Increasing satellite velocity dispersions 
suppresses P2/0 at steadily larger scales. Central galaxy ve- 
locities with Ovc = 0.2 produce almost no change, while the 
model with a^c = 1 shows even stronger suppression than 
the satellite q„ = 1.2 model. 



4.2 The Correlation Function 

Since the power spectrum and correlation function are re- 
lated by Fourier transformation, the linear theory approxi- 
mation to Pz(k, jj.) also applies to ^(r^, r^). Hamilton (1992) 
introduced the multipole approximation in configuration 
space, devising linear theory diagnostics of (,{ra,r-n) that 
parallel those in equations JUJ and 1101 . The multipoles of 
the redshift space correlation function, £,i{r), are calculated 
by the same inversion formula used in the Fourier domain. 



6(0 = 



2/ + 1 



(11) 



where r — Vr'i + r'i- and fi — r,r/r. The ratio of the 
monopole, ^oir), to the real-space correlation function, 
^R{r), exactly parallels equation @, 



The quantity 



Qdr 



6(0 



eo(r)-Co(r) 1 + 1/3+1/32' 



(12) 



(13) 



has the same asymptotic value as P2/0 in linear theory (as- 
sumed for the second equality above). Here ^oir) is the 
spherically averaged monopole. 
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(14) 



We henceforth refer to as the quadrupole of the redshift- 
space correlation function. To calculate ^oir) and ^2{r), we 
bin galaxy pairs on a polar grid of logarithmic spacing in r 
and linear spacing in angle, then perform the integral lllll 
numerically at each r. 

Figure ITUl shows the results for ^o/flj plotted as a linear 
function of r. In each panel, the curves reach an asymp- 
totic value quickly, near r = 10 h~^Mpc. In most cases, the 
asymptote is above the dotted line representing the linear 
theory prediction. Despite this small systematic bias, which 
increases with increasing /?, this diagnostic does not suffer 
from non- linear suppression of distortions at large scales; a 
fit to a constant value is straightforward. Another notable 
advantage of this diagnostic is that the effects of velocity 
bias (panel d) are almost negligible beyond r — W /i~^Mpc. 
FOG compression (a„ = 0) removes the systematic offset 
between ^o/r and the linear theory prediction at r ~ 10 — 30 
h~^Mpc. This result suggests that the offset is a consequence 
of FOGs transferring pairs from small separations in real 
space to large separations in redshift space. 

Figure irTl plots as a linear function of r. These curves 
resemble those of the power spectrum measures plotted as 
a function of log A. Models with low values of (3 reach a 
horizontal asymptote at large r, while for the high-/? 
models is still increasing at the largest separation. All the 
curves are under the predicted linear theory values, in con- 
trast to the results for ^o/r- Figure ITTl i shows that small 
scale dispersions are the main effect suppressing Qj; with 

= 0, tracks the linear theory prediction down to 
r = 10/i~^Mpc. Increasing satellite or central galaxy ve- 
locity dispersions drives the non-linear suppression of to 
larger scales. 



fitting functions are arbitrary, motivated by efficacy rather 
than theoretical arguments, but they all encode the general 
behavior of linear distortions at large scales suppressed or 
reversed by non-linear effects at small scales. 

For the quadrupole-to-monopole ratio of the power 
spectrum, our procedure is similar to that proposed by Hat- 
ton & Cole (1999; hereafter HC99), who suggest the fitting 
formula 



P2/0W = ^-270 [1 - (VAo)-'-' 



(15) 



Here P2/0 ^^e linear theory quadrupole distortion, related 
to f} by equation (1101 , and Ao is the non- linear scale at which 
the quadrupole passes through zero. We make two changes 
to the HC99 procedure, which, in our experiments, improve 
the accuracy and robustness of the /3 estimates. First, we 
calculate Ao by fitting a straight line to the six data points 
surrounding P2/0 ~ 0, instead of leaving it as a fitting pa- 
rameter in the global fit. Second, we modify equation 1151 
to 



P2/0W — P2/() 



-1. 55/(0.45-1- P"°) 



(16) 



We determine the fitting parameter Pj/o by minimizing 
for all data points with A > Ao, ignoring any covariance 
of errors, and we then solve for /3fit using equation 1101 . 
Since Pj/o varies around ~ 0.55, the exponent in equation 
l|16|l is similar to that in HC99's formula, but including a 
dependence on P2/0 captures the behavior seen in Figure 1^1 
where the P2/0 curves for higher P models flatten toward 
their asymptotic values at larger scales. 

We use a similar procedure to estimate /3 from Po//j(A). 
Here we define the non-linear scale Ai as the wavelength at 
which Pz ~ Po , and we determine it by fitting a straight line 
to the six data points around Po/r = 1- We fit the functional 
form 



Po/rW = {PoTr - 1) 



1 - 



4.3 Estimating /3 

The a„ = curves in Figures |HJi, Hi, [Mi, and [TTJi show 
that /3 can be estimated fairly accurately using linear the- 
ory if FOG distortions are removed by suppressing velocity 
dispersions in virialized halos. However, these curves repre- 
sent a case in which FOG compression is perfect, with halos 
identified in real space from the densely sampled dark mat- 
ter distribution. Any realistic scheme must operate on the 
sparsely sampled galaxy distribution in redshift space, and 
it will suffer from incompleteness and contamination of the 
halo catalog and incorrect assignments of galaxies to ha- 
los. The impact of these imperfections on /3 estimates must 
be evaluated in the context of a specific group identifica- 
tion scheme applied to a survey with specified depth and 
geometry, and we will not consider the FOG compression 
approach further in this paper. Instead, we will use our nu- 
merical results to devise fitting procedures that estimate /3 
and a characteristic non-linear scale from measurements of 
P2/o{k), Po/R{k), ^o/R{r), and Q^{r). In the remainder of 
the paper, we use the notation Pat to represent a value of /3 
estimated by one of these fitting procedures, and use /3 to 
represent the true model values of /bg. The forms of our for data points with r ^ Rq. 



^ . -1.57/{0.50+ftit) 



+ 1, (17) 



where Pat and Pq^^ are related by equation (|5J . We estimate 
/3flt by minimizing for all data points with A ^ Ai. As 
with equation 1161 . the form of the exponent captures our 
numerical finding that higher /3 models approach asymp- 
totic behavior more slowly. In this case, we found that using 
Pfit rather than Pq^j^ in the denominator of the exponent 
produced more accurate results. 

For Q^, we adopt the fitting function 



Qdr) = Q 



1.45 

1^ 



0.75{l~r/Ra)' 



(18) 



where once again Q^'" is the free parameter and its relation 
to /3flt is defined in equation 1131 . The parameter Po is the 
scale at which £,2{r) = 0. Since the data for are much 
smoother than those for the power spectrum diagnostics, it 
is sufficient to fix Po by simple interpolation between the two 
points surrounding = 0. We determine Pat by minimizing 
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X [h-'Mpc] A [h-'Mpc] 



Figure 8. Monopole-to-real space ratio of the power spectrum, Pq/Pr, as a function of wavelength, up to the fundamental mode of 
the 253 /i~^Mpc box. The dotted lines represent the linear theory prediction for each model for this measure. The error bars plotted at 
the bottom of each panel are errors in the mean for the central model, which are generally representative of the error bars for the rest 
of the models in each set. Models run from trg = 0.95 (black) to erg = 0.6 (cyan) in panel (a), from Qrn = 0.1 (black) to Vlrn = 0.5 (cyan) 
in panel (b), and from eg = 0.95 (black) to erg = 0.6 (cyan) in panel (c), with the order of the colors being black, red, green, blue, cyan. 
In panel (d), models are Qi, = (black), a„ = 0.8 (red), a„ = 1.0 (green), cxy = 1.2 (blue), and a^c = 0.2 (cyan, barely separable from 
green), and a^c = 1 (magenta). 



For Co/Hi we find that the most effective method to esti- 
mate P is simply to fit a straight line to all data above r = IQ 
/i~^Mpc, and calculate /3flt from linear theory. A minimum 
scale below 10 /i~^Mpc allows non-linearities to affect the fit, 
while a larger minimum scale reduces the precision because 
the error bars increase monotonically with r. 

Figure fT?l presents the main quantitative results of this 
section, showing the fractional error e = (/3fit — /3)//3 of the 
/3 estimates from P2/0J Po/r^ Q^y a^nd ^o/fli using the fitting 
procedures described above. For the left hand panels, we fit 
the curves shown in Figures (|HJ — IllL which are averaged 
over three projections of the five F = 0.2 simulations. Right 
hand panels show results of the same procedures for the 
F = 0.12 simulations. 

Squares represent the fixed [(Tg,a„] model sequence, 
with the fim range 0.1 — 0.5 producing /3 values from 0.24 
to 0.63 (see Table 2). The fixed [f2m,atj] sequence, shown 



by the triangles, spans a narrower range of 0.36 ^ /3 5C 0.53, 
since we limit erg to the range 0.6 — 0.95. Five-point stars rep- 
resent [/?, a„] models, which all have /? = 0.46 by construc- 
tion. Hexagons represent the a„ = 0.8 and = 1.2 models 
from the fixed [fimjCrg] sequence. The q„ = 1.0 model is the 
same as the central model by definition, and the model with 
Que = 0.2 is indistinguishable from it in practice, so we omit 
it from the plot. The a^c ~ 1 model is shown with the small 
filled circle (left panels only). We do not show results for the 
FOG compression model because our fitting procedures do 
not apply to it. 

For the fixed [(Tg,a„] sequence, we calculate the statis- 
tical uncertainty in our estimate of the fractional error e by 
separately fitting the five simulations in turn, then dividing 
the dispersion of the fiat values by \/5 — 1 = 2 to obtain the 
uncertainty in the mean. These uncertainties are shown by 
error bars on the squares in Figure [T^ In many but not all 
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Figure 9. Quadrupole-to-monopole ratio of tlie redsliift space power spectrum, P2/P0, as a function of wavelengtli, up to the fundamental 
mode of tlie 253 /i~^Mpc box. The dotted lines and error bars are as in Figure|S] Models run from erg = 0.95 (black) to erg = 0.6 (cyan) 
in panel (a), from Q^n = 0.1 (black) to = 0.5 (cyan) in panel (b), from trg = 0.95 (black) to erg = 0.6 (cyan) in panel (c), with the 
order of the colors being black, red, green, blue, cyan. In panel (d), models are a„ = (black), = 0.8 (red), a„ = 1.0 (green), cty = 1.2 
(blue), and aye = 0.2 (cyan, barely separable from green), and a„c = 1 (magenta). 



Table 3. Errors in /3flt 





r = 


0.2 


r = 0.12 




Method 


(£2>l/2 [%] 


[%] 


{e2>l/2 [%] 


[%] 


P2/0 


4.4 


1.9 


4.2 


1.9 


Po/R 


1.3 


0.6 


1.8 


-0.7 




4.3 


0.7 


3.5 


1.7 


(o/R 


6.1 


5.9 


11.3 


10.9 


Lin+Exp 


9.4 


4.2 


18.4 


14.6 


HC99 


14.9 


14.0 


17.3 


16.5 



cases, our measurement of the bias in /3fit for a given model is 
consistent with zero, or only marginally inconsistent with it. 
However, even when the offsets from zero are within the er- 
ror bars, the trend with model parameters along a sequence 
may be significant, since all of our models are based on the 
same set of simulations. The total volume of our simula- 



tions is 5 X (253/1 ^Mpc)'^, equivalent to that of redshift 
survey covering 8000 square degrees to a limiting depth of 
460 ft-^Mpc. Since the three orthogonal projections sample 
different random orientations of the large scale structures 
in each simulation, the effective volume is somewhat larger, 
though the increase is not a full factor of three because real- 
space structures are the same in each projection. The error 
bars in Figure [T?l are therefore similar in magnitude to the 
statistical error expected from the full SDSS redshift survey, 
which will cov er 8000 square degre es with a median galaxy 
redshift ~ 0.1 dStrauss et al.ll2002l) . 

Table 3 summarizes the performance of the four /3- 
estimators, listing the mean and rms value of the fractional 
errors plotted in Figure^] Note, however, that the numbers 
depend on the particular set of models we have chosen, so 
they are only a rough indicator. For ^o/fli our procedure of 
fitting a straight line to the measurements above 10 /i~^Mpc 
gives a precise but not accurate value of Pat, as seen earlier 
in Figure nUl The mean offset is 5.9% for F = 0.2 and 10.9% 
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Figure 10. Ratio of the monopole of the redshift-space correlation function to the real-space correlation function, ^o/Hi ^.s a function 
of separation r. The dotted lines and error bars are as in Figure|H] Models run from erg = 0.95 (black) to o-g = 0.6 (cyan) in panel (a), 
from = 0.1 (black) to Qrn = 0.5 (cyan) in panel (b), from erg = 0.95 (black) to erg = 0.6 (cyan) in panel (c), with the order of the 
colors being black, red, green, blue, cyan. In panel (d), models are a„ = (black), = 0.8 (red), o„ = 1.0 (green), av = 1.2 (blue), and 
eivc = 0.2 (cyan, barely separable from green), and a^c = 1 (magenta). 



for r — 0.12. The rms values of e are only slightly larger, 
consistent with the small scatter around the the mean offset 
seen in Figure WIl though for F = 0.12 there is a weak but 
clearly significant trend of e with /3. Increasing the minimum 
fit radius above 10 h~^Mpc reduces the correlation but does 
not eliminate the higher mean error. 

The Pq/h fits yield accurate /3 estimates, with mean 
errors of less than 1% that are within the statistical uncer- 
tainty of our calculations. The rms errors are only 1.3% and 
1.8% for F = 0.2 and F = 0.12, respectively. Velocity bias 
does have a noticeable effect on the Pq/r estimator, with 
±20% changes in producing =p2.4% changes in Pat- 

Errors for the quadrupole estimators P2/0 and are 
larger, in part because of our larger statistical uncertain- 
ties, but also because of stronger variation with model pa- 
rameters. Velocity bias has a significant impact on P2/0J 
with ±20% changes in a„ producing =p9% changes in Pat 
for F = 0.2. For the effect is smaller, =f5.5%. The slope 
traced by the triangular points shows that the bias of the 



estimator changes steadily with erg, from —6% at erg = 0.6 
to +8% at erg = 0.95 for F = 0.2. A similar trend with erg 
appears in the constant-/3 sequence. 

For comparison, the lower panels of Figure show the 
results of applying the HC99 and linear-exponential models 
to our simulation results. The HC99 procedure is applied to 
P2/0 measurements with P2(k) ^ 0, and we implemented 
the linear-exponential model by minimizing with respect 
to ^(rcr, r-^) for all data with r ^ 5 /i~^Mpc. Note the larger 
vertical scale on these panels. The HC99 simulations empha- 
sized values of 13 > 0.6, and for /3 > 0.5 we also find it to 
be fairly accurate, with a bias ~ 10%. However, for lower (3 
values the HC99 procedure substantially overestimates the 
true /3, and our modification defined by equation (1161 is a 
major improvement. 

The linear-exponential model performs reasonably well 
for F — 0.2, but there is a steady trend from positive bias at 
low (3 to negative bias at high (3, and the rms error of 9.4% is 
substantially larger than for any of our estimators. Increas- 
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Figure 11. Quadrupole moment of the correlation function, = ^2/(^0 — 5o)i as a function of separation. The dotted lines and error 
bars are as in Figure 1^ Models run from erg = 0.95 (black) to erg = 0.6 (cyan) in panel (a), from Qrn = 0.1 (black) to = 0.5 (cyan) 
in panel (b), from erg = 0.95 (black) to trg = 0.6 (cyan) in panel (c), with the order of the colors being black, red, green, blue, cyan. In 
panel (d), models are a„ = (black), o^, = 0.8 (red), a„ = 1.0 (green), av = 1-2 (blue), and a^c = 0.2 (cyan, barely separable from 
green), and o„c = 1 (magenta). 



ing the minimum fitting scale from 5 /i~^Mpc to 10 h~^Mpc 
makes little difference. For F = 0.12 the linear-exponential 
model breaks down more seriously, overestimating /3 by up 
to 40%, and showing strong correlation of the /3flt error with 
f3 and with ag. 

By determining non-linear scales directly from the data, 
our /3-fitting procedures avoid any explicit dependence on 
as, f2m, or a„. Or course, for known values of erg or a„, one 
could use Figure lT^ to remove the bias of the estimator, fur- 
ther improving its accuracy. Our fitting formulas 11611 — HSU 
are obtained empirically, with only a qualitative relation to 
a full physical model. However, they successfully describe 
models with a wide range of physical parameters, and we 
will show in §4.5 below that the non-linear scales in these 
fits depend on erg, flm, and a« in physically sensible ways. 

The estimates based on redshift-space to real-space ra- 
tios, Pq/h and fo/fli perform more robustly than those in- 
volving quadrupole moments, once the linear theory esti- 
mate from Co/ii is corrected for systematic bias. Further- 



more, the monopole components Po{k) and ^o('') can be 
measured with higher precision than the quadrupoles P2{k) 
and €,2(t'), for a data set of fixed size. However, we have not 
addressed the proble m of determining the re al-space quanti- 
ties Pfl(fc) and fBfrl. lHamilton et alJ ll200d) propose meth- 
ods for recovering the former, by combining the monopole, 
quadrupole, and hexadecapole on large scales, and using the 
power in modes transverse to the line of sight on small scales. 
For £,R(r), one can invert the projected correlation func - 
tion wp(rp) fsee lDavis fc Peeblesll983l:IZehavi et"aLll2004bD . 
Alternatively, having fit Wp(rp) with an HOD model, one 
can take the three-dimensional correlation function of that 
model to represent ^R{r). It is possible that estimating 
PR(k) or ^R{r) in these ways will degrade the performance of 
the redshift-to-real space estimators, introducing systematic 
errors or larger statistical errors. We leave that question to 
future work that involves mock catalogs tailored to specific 
data sets. 
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Figure 12. The relative errors in /3fit obtained from the fitting functions, for models with F = 0.2 (left) and V = 0.12 (right). In each 
panel, the solid lines represent zero error, while the dotted lines are ±10% error. The four different point types correspond to the four 
model sets in Table 2: constant [f2m,a„] (squares), constant [o'8,a„] (triangles), constant [/3, a„] (stars), constant [HmiCrg] (hexagons). 
The av = model is not shown. The a^jc = 1 model is plotted with a solid circle (left panels only). The bottom panels show, with an 
expanded vertical scale, the systematic errors of the linear-exponential model and the HC99 fitting method. 



4.4 From (i and Q,m. to tjg 



With sufficiently good observational data, the procedures 
described in §4.3 can provide estimates of /3 = /bg 
that are accurate to a fenr percent or better. For a speci- 
fied value of Qm, this estimate in turn yields an estimate 
of bg. However, for cosmological purposes we are less in- 
terested in bg per se than in the dark matter fluctuation 
amplitude erg. In this paper we define bg to be the mean 
value of l^g{r)/^m{r)Y^'^ over the range 4 /i"^Mpc r 12 
/i~^Mpc, where the average is inverse variance weighted and 
(,m{r) is the non-linear correlation function of the simulation 
dark matter particles. The value of bg is insensitive to in- 
creases in the inner or outer cutoff on the averaging regions, 
though it drops if the minimum radius is pushed much below 
4 h~^Mpc. For example, changing the range to 10 /i~^Mpc 
^ r ^ 25 h~^Mpc, changes bg of the central model from 
1.041 to 1.026, the largest change of the five models. 



The standard analytic approximation for the large-scale 
bias factor, 

fe. = -y^ 6.(Af)(7V)M^dM, (19) 

describes our numerical results for bg with an rms error of 
0.4% for r = 0.2 and 0. 6% for F = 12, if we use the 
halo bias formula bh (M) of|Tinke^^ ai] ||2004) and the halo 
mass function dn/dM of H^nkin^^^n |20Mr l. The bias bg 
is a monotonically decreasing function of ag , since we match 
the same galaxy correlation function by construction. The 
most robust way to convert a value of /3flt to a value of erg 
(for a specified Qm) is to consider a sequence of models of 
increasing ag, carry out HOD fits to match the observed 
projected correlation function mp(r-p) in each case, compute 
bg from {N)m using equation 119^ . and pick the value of erg 
for which /bg = /3flt. 

By definition, erg is given by an integral over the linear 
theory dark matter power spectrum Piin(fc). In the linear ap- 
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Figure 13. Comparison of bias definitions. Curves sliow tlic 
square root of the ratio of the non-Hnear galaxy and Hnear mat- 
ter power spectra, for the five erg values as indicated. Error bars 
shown for the eg = 0.95 model are the error in the mean from 
five realizations. Thin lines are the value of bg measured from the 
correlation function ratio and used in our definition of /3. 



proximation, where — Pg{k) / Piin{k), one can use an esti- 
mated bg and the measured galaxy power spectrum Pg{k) to 
normalize Piin(fc) and thus compute as- Figure ITsl compares 
our definition of bg (horizontal lines) to the power spectrum 
ratios [Pg{k)/ Pnnlk)]^^^ of the T = 0.2 simulations. For all 
five values of erg, the power spectrum ratios are consistent 
with a constant asymptotic value at large scales, and this 
asymptotic value is consistent with the value of bg defined 
from the correlation function ratio. However, even with our 
360^ simulations, we cannot make this comparison at a pre- 
cision better than a few percent because there are relatively 
few Fourier modes in the asymptotic regime. Furthermore, 
the power spectrum ratios lie slightly above bg for erg = 0.6 
and slightly below for erg — 0.95, with a steady trend in be- 
tween. The same trend appears in Table 1, where the prod- 
uct asbg rises from 0.81 to 0.88 as as grows from 0.6 to 0.95. 
Thus, simply normalizing Piin(fc) by Pg{k)/b^ would not ac- 
curately describe our results at the few percent level. The 
trend of asbg arises because we set our HOD parameters by 
fitting the galaxy correlation function in the linear and non- 
linear regime; at the few percent level, our large-scale galaxy 
correlation function is higher for high as (see Figure |5J. If 
we forced a perfect match of the galaxy correlation function 
at large scales, then crg6g would be constant, but we could 
no longer match £,R{r) as well at small scales, at least with 
our three-parameter HOD. 

The passage from /3 and to as would be easy if we 
defined the galaxy bias bg = bs = as,g/as, where as,g is the 
(non-linear, shot noise subtracted) rms galaxy count fluctu- 
ation in 8 ^~^Mpc spheres. In this case, one could simply 



Figure 14. Infiuence of cosmological parameters on the non- 
linear length scales of the large-scale distortion measures, (a) The 
wavelength Ai at which Pq/Pr = 1 is plotted against /3crg''^a^. 
Large and small points show the F = 0.2 and F = 0.12 results, 
respectively, with point types corresponding to model sequences 
as indicated in the legend, (b) — (d) Non-linear length scales for 
the other three large-scale measures, plotted against the combina- 
tions of /3, (Tg, and a„ that produce the least scatter, for P = 0.2. 
Dotted lines show least squares fits to the data points. 

divide /3flt by and multiply by the measured as,g to 

obtain as- We have tried to develop procedures like those 
in §4.3 to estimate /3s = /bs- However, once we tune 
the estimation formulas to the F = 0.2 simulations, they do 
not provide accurate results for F = 0.12, in contrast to our 
procedures for /3, which give accurate results for both power 
spectrum shapes. An 8 /i~^Mpc top-hat does not suppress 
non-linear clustering enough for the bias factor bg to approx- 
imate bias in the linear regime (as also noted by HC99). 

In Paper II, we develop an analytic approach that cir- 
cumvents the complication of mapping (3 into the erg — 
parameter space, as the fitting parameters are (f2m, erg, a„), 
without reference to /3. 



4.5 Length Scales in Large-Scale Distortions 

The distortions in redshift-to-real space and quadrupole-to- 
monopole ratios in Figures |H| and 1111 are driven mainly by 
galaxy velocity dispersions on small and intermediate scales, 
which drive down the redshift-space correlation amplitude 
and reverse the sign of quadrupole distortions. The non- 
linear length scales in equations llOII . and (1131 . and the 
radius -Ri at which = £,r-, therefore encode information 
about the parameters Q,m, as, and q„, as an increase in 
any of these variables increases the galaxy velocity disper- 
sion. The dependence of the galaxy velocity dispersion on 
Qrn is straightforward: at fixed erg, the large-scale velocity 
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field follows the linear theory scaling , and the virial ve- 
locities of halos of fixed abundance scale as f2m^ (ignoring 
the small dependence of halo concentration on f2m). These 
two effects appear at different scales, but we find that the 
pairwise galaxy velocity dispersion scales roughly as fi^^^ in 
our simulations at all separations. For and a„ , the situa- 
tion is more complicated. Velocity bias is most influential at 
small scales, where the galaxy pairs come from within one 
halo. At larger scales, a significant fraction of pairs involve 
the central galaxies of low-mass halos, and are thus not af- 
fected by satellite velocity bias. Inspection of our numerical 
results suggests that at large separations the pairwise dis- 
persion scales as 0?^^. The power spectrum normalization 
affects the galaxy velocity dispersion in two ways: at linear 
scales the halo velocity dispersion increases linearly with o-g, 
while the internal velocity dispersions of halos hosting mul- 
tiple galaxies increase with erg because of the higher halo 
masses. 

Inspection of the a nalytic solution fo r Pq/r in the linear- 
exponential model (see lCole et aljFlQQSl §2.1) implies that 
the non- linear scale Ai where -Po/fl = 1 should scale linearly 
with the velocity dispersion cr„ at fixed /3 and approximately 



as /3 



-1/2 



at fixed (T„. With the scalings oc fi^ Qucrg 



discussed above, we obtain 

\ a-l/2 o0-25 0.5 

Al OC /J ' (Ti, (X i2„ (Tg «„, 



(20) 



where the last relation uses /3 cx (Tgf2^ . 

The left-hand panel of Figure 1141 plots Ai against 

1/2 9 

/3o-g Qf„, a combination of parameters chosen by trial and 
error to yield minimal scatter. The numerical data form a 
tight power-law for the F = 0.2 models. The statistical er- 
rors derived from the run-to-run dispersion are of order the 
point size, and the fit has a per degree of freedom of 
8.9, indicating that most of the model-to-model scatter is 
physical in origin. The data for the F = 0.12 models fol- 
low the same slope, but the amplitude of the relation is 5% 
higher, and there is more scatter. The dotted line plotted 
in the panel is a least squares fit to the F = 0.2 data. The 
slope is ~ 0.36, making a scaling of Ai = 26.3 ilm^^ag'^^aJ!'^^ 
/i~^Mpc. Given the approximate nature of the arguments 
behind equation 12011 . the agreement with the numerically 
derived scaling is quite good. The lower index on in the 
numerical results arises because the scale Ai ~ 10 /i^^Mpc 
is outside the one-halo regime where oc but not fully 

1/2 

in the large scale regime where cr^ oc . 

The remaining panels of Figure IT^ plot the other non- 
linear length scales against a combination of parameters cho- 
sen by trial and error to produce minimum scatter. For the 
central model, the P2/0 zero-crossing Ao is slightly smaller 
than Al, Ao is ~ 2 times the zero-crossing i?o, and Ro is 
~ 3 times the scale Ri at which ^o/fl = 1- E)otted lines 
show best-fit power-law relations, Ao = 20.7 (/^aga^)"'^*, 



Scatter for the 



Ro = 8.7(/3crgaS)°-^'', Ri = 2.9 (/JcTg/^Q, 
quadrupole length scales is consistent with the statistical er- 
rors (see panel b), which are larger for these measurements. 

In principle, these non-linear length scales can help de- 
termine cosmological parameters by adding another observ- 
able quantity to break degeneracies in our three-dimensional 
parameter space. For example, once j3 is fixed by the large- 



scale distortions, the measurement of Ai constrains the pa- 
rameter combination (Tg'^*a°'^^. Since the different length 
scales have different parameter dependencies, once can use 
combinations to isolate and erg. For example, the best-fit 
power laws imply 



Al 



26.9/i-iMpc 



Ri 



2.92/i-iMpc 



(21) 



The models with no velocity bias (a„ = 1) follow this rela- 
tion with an rms error of 3.8% and a mean error of —2.4%. 
For the models with a„ = 0.8, 1.2, equation (I2H predicts 
0.80 and 1.10 respectively. The power law fits for Rq and Ai 
yield 



erg = 



Ro 



8.71 /i-iMpc 



Al 



26.9/i-iMpc 



(22) 



The values of erg predicted with equation 12211 are accurate 
to within an rms error of 12.6%. 



5 SMALL-SCALE DISTORTION 

While the non-linear length scales give some measure of 
small-scale velocities, we can characterize these velocities 
more physically and more accurately by focusing on distor- 
tions at small r^, where they dominate. The traditional mea- 
sure of small-scale distortions is the pairwise velocity disper- 
sion, but this is not a direct observable; it is extracted from 
the data by fitting a model that specifies the scale depen- 
dence of the mean pairwise velocity of galaxies and the form 
of the velocity distribution (e.g., Davis & Peebles 1983). We 
would prefer a quantity that is measured directly from the 
data, and here we follow the lead of Fisher et al. (1994), who 
use 5(r^) at fixed, small r^. Referring back to Figure|Kl we 
see that ^(r.^) at small is constant for a range of r-„, before 
turning over at a scale determined by the galaxy velocity dis- 
persion. We can quantify this turnover by the measure 
the value of at which the correlation function decreases 
by a factor of two relative to its value at = 0. More gen- 
erally, one could use the shape of €,{r-n) /(^ir-n = 0) over some 
range of the line-of-sight separation, scaling by ^(r.^- = 0) to 
remove the sensitivity of the distortion measure to the exact 
value of the real-space correlation function. 

Figure IT^ plots against Tc for the [Omjau] se- 
quence. All the curves have a characteristic wave pattern, 
which rises to a maximum at ~ 0.6 /i~^Mpc and reaches 
a minimum at ro- ~ 1 /i~^Mpc. The rise at small separa- 
tion is the result of including one-halo galaxy pairs from 
increasingly more massive halos with higher velocity disper- 
sions. The minimum at 1 /i~^Mpc occurs near the one-halo 
to two-halo transition in the real-space £,R{r). At this separa- 
tion, two-halo pairs come largely from the central galaxies of 
lower mass halos, so they do not have an internal dispersion 
contribution, and the halo pairwise velocities themselves are 
relatively low. At > 1 /i~^Mpc, all curves monotonically 
increase, as the internal dispersions of large halos again start 
to contribute and the pairwise dispersion of halos themselves 
increases. To highlight the differences between the models, 
panels (b) — (f) plot five model sequences where all the 
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Figure 15. The smalle-scale distortion parameter, ?'^/2, as a function of transverse separation. For panels (b) — (f), each curve has been 
normaUzcd by the values for the central model, while panel (a) plots the curves as log for the constant [Qrn,cev] sequence, without 
normalization. Models run from erg = 0.95 (black) to ag = 0.6 (cyan) in panels (a) and (b), from Q^n = 0.1 (black) to Q^n = 0.5 (cyan) 
in panel (c), from eg = 0.95 (black) to trg = 0.6 (cyan) in panel (d), with the order of the colors being black, red, green, blue, cyan. In 
panel (e), models are = (black), a„ = 0.8 (red), a„ = 1.0 (green), a„ = 1.2 (blue), aye = 0.2 (cyan), and a„c = 1 (magenta), (f) 
Models are the same as panel (b), but with F = 0.12. 



curves have been normalized by the values for the central 
model (flm = 0.3, ag — 0.8, a„ = l,Qt,c = 0). Panels (b) 
— (e) show the standard suite from Table 2 and earlier fig- 
ures. In panel (b), with fixed flm and a^, changing erg has 
little effect on r^y2 at Vcr ^ 0.2 /i~^Mpc. This separation is 
small enough that rare, high-mass halos do not contribute 
a large fraction of the one-halo galaxy pairs relative to the 
pairs contributed by halos with mass Mh ~ Mi, where erg 
has little effect on the halo mass function. The value of ug 
has a large impact on at ~ 1 h~^Mpc, the location 
of the one-halo to two-halo transition. More high mass ha- 
los create more large separation one-halo pairs, extending 
the one-halo (,R(r) to larger r. These pairs have large veloc- 
ity dispersion and are therefore spread out along the line of 
sight, increasing 

In panel (c), with fixed erg and a^, changing ^Im affects 
at all < 10 /i~^Mpc. Higher Qm increases both halo 
pairwise velocities and internal velocity dispersions, thus in- 



creasing on all scales where dispersion dominates over 
coherent flows. Panel (d) shows models with constant /3 and 
Q„, and thus constant large-scale anisotropy. As expected 
from the previous results, higher Qm models have larger r^^2 
at To- ~ 0.1 /i~^Mpc, where ag has little impact. At ro- ~ 1 
h~^Mpc, the higher flrn models (with lower erg) have smaller 
r^/2', the depression seen in panel (b) wins out over the en- 
hancement in panel (c). Thus, at fixed (3 and a^, the small 
scale distortions can break the degeneracy between Qrn and 
erg. 

Panel (e) shows models with varying but constant 
Qrn and ag (and thus constant /3). Not surprisingly, the 
Uv — model has very small values of r^/2 relative to the 
central model at scales less than 10 h'^Mpc. The effect of 
moderate velocity bias is most significant at the smallest To- , 
with 20% changes in r^/2 at = 0.1 fe~'^Mpc for a„ = 1.2 or 
0.8. However, these Ov variations have little impact at large 
Va, where 20% changes of internal velocity dispersions are 
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small compared to halo velocities themselves, and the effect 
is essentially zero at ~ 1 /i~^Mpc. At this separation, 
two-halo pairs begin to dominate ^(r-o-,r^), but is still 
smaller than the virial radii of large halos. Most pairs there- 
fore come from halos that contain a central galaxy and no 
satellites, and the value of has no effect. Central galaxy 
velocities have maximum effect at the ~ 1 /i~^Mpc scale, 
for the same reason. Setting a^c = 0.2 boosts by 5- 
10% at this ro-, while treating central galaxies like satellites 
(a^c = 1) boosts it by a factor of two. 

Panel (f) plots the results for the constant [nm,a„] se- 
quence with r = 0.12, once again normalized by the F = 0.2 
central model. As in panel (b), erg has minimal effect at small 
scales and makes the most difference at To- ~ 1 — 2 /i~^Mpc. 
The higher at large in the V = 0.12 models probably 
reflects the shallower real-space correlation function at these 
scales. 

Figure ITKl demonstrates that is a robust diagnostic 
for Q.m and when is small, independent of or F. 
In figure ITBk . the upper points plot r^/2(0.1) against SlmQ!^ 
for all of the F = 0.2 models (except those with = Q 
and a„c = 1). The data follow a power law with a slope of 
0.46 and minimal scatter. For one-halo pairs, the redshift- 
space separation depends on relative velocities, which are 

1/2 

proportional to Q,m oiv, and one might therefore expect a 
slope of 0.5. Because there is a small two-halo contribution 
to ^(ro-,r^) at these separations, the slope deviates slightly 
from this expectation. The data for F = 0.12 follow a simi- 
lar power law, but with a normalization ~ 7% lower, as ex- 
pected from the results in Figure This offset may arise 
partly from the difference in the real-space correlation func- 
tion, which is shallower for F = 0.12, and partly from the 
difference in the halo mass function, which changes the rel- 
ative importance of pairs from different halos. 

The values of (3 and rj/2(0.1) provide two observable 
constraints in our three-dimensional (nm,o"8,Qj,) parame- 
ter space, measuring the combinations agfim^ and ilmQ^. A 
measurement of r(^/2 at somewhat larger has the possi- 
bility of providing a third constraint on a different combi- 
nation of these parameters. Based on the power-law fit in 
Figure ll(ib . each constant-/? model was given the value of 
Uv required to match rj/2(0.1) of the central model. Rela- 
tive to Figure [TKh at fixed [/3, a„], this scaling brings curves 
together at < 1 /i~^Mpc, but it makes little difference 
at larger separations where a„ has little effect. Differentiat- 
ing between adjacent models requires high precision in the 
measurements, but there is a clear, 20% separation between 
the low and high values of ag, with this diagnostic. In Figure 
116b . we plot r^/2 against erg for Tc — 3,4, and 5 /i~^Mpc. 
At each transverse separation, there is a monotonic, nearly 
linear trend with ug once /3 and r^/2(0.1) have been fixed. 
These results allow for unambiguous determination of ag, 
breaking the third and last degeneracy in the parameter 
space. 

Figure IT^ assumes a^c ~ 0, and central galaxy veloc- 
ities could interfere with this approach to breaking degen- 
eracies. For example, adopting a^c = 0.2 increases rj/2(0.1) 
by ~ 5%, which is of order the effect of changing erg by 
0.1. However, the effects of moderate a^c on this measure 



go away at scales larger than 3 h ^Mpc, where there is still 
clear model differentiation in Figures IT5h and [Ht. As we 
have already noted, physical arguments and hydrodynamic 
simulations support the assumption of low a^c, but further 
theoretical and observational investigation of this point is 
warranted. 



6 DISCUSSION 

Our results provide a blueprint for obtaining constraints 
in the (f2,„,ag,a„) parameter space from measurements of 
clustering anisotropy in redshift space. For each model in 
the parameter space, one first chooses HOD parameters to 
reproduce measurements of the projected galaxy correlation 
function uip(rp), which depends only on the real-space corre- 
lation function £,R{r). If the assumed power spectrum shape 
is correct, it will generally be possible to match 'Wp{rp) well 
for a wide range of as and fim. At large scales, the anisotropy 
ratios Pq/r, P2/0, ^o/r or then depend on /? = Q'^'^/bg, 
where = ^g/^m is a monotonically decreasing function 
of (Tg for fixed galaxy clustering (see §4.4). These measures 
scale with cosmological paramet ers as predic ted by linear 
theory and the linear bias model jKaiseilll987D . even though 
these approximations do not provide an accurate description 
of anisotropy on most scales accessible to observations or to 
our simulations. One can estimate /3 by fitting Po/b., P2/0, 
or as a function of scale using our equations II6II . 11711 . 
and II8I I. or by measuring (,o/r at r ^ 10 /i~^Mpc and cor- 
recting for the ~ 6% bias of linear theory (see Figure [T^ . 
The turnover scales in the fitting functions depend on the 
velocity bias Qf„ , but they can be measured directly from the 
anisotropy ratios, so the j3 estimates themselves are largely 
independent of a„. 

The turnover scales can be used to break degenera- 
cies in the parameter space, but the line-of-sight correla- 
tion function ^(r^) at fixed, small ra provides a more di- 
rect measure of velocity distortions in the highly non-linear 
regime. In particular, for small the scale r^/2 defined by 
5(ro-,r^/2) = 0.5 X 5(rCT,0), quantifies the typical length of 
"fingers-of-god," and hence the characteristic amplitude of 
pairwise velocity dispersions. At Tc ~ 0.1 /i~^Mpc, where 
most pairs come from intermediate mass halos, we find that 
rj/2 depends on JlmQ^ with essentially no dependence on 
(Tg. At ~ 1 — 5 /i~^Mpc, rj/2 has a significant depen- 
dence on (Tg even at fixed P and a„, with Afrg ~ 0.1 cor- 
responding to Ar5/2 ~ 5%. Therefore, one can in princi- 
ple use measurements of large-scale anisotropy and ^(r^) 
at ro- ~ 0.1 — 5 /i~^Mpc to separately determine the val- 
ues of (Tg, and a„. Alternatively, one can measure (5 
and 0.mOL^ as described above and adopt theoretical priors 
on a„ from hydrodynamic simulations of galaxy formation 
(e.g., Berlind et al. 2003), or combine redshift-space distor- 
tions with other observables that constrain different combi- 
nations of (Tg and Q.m- For example, galaxy-galaxy lensing 
measurements constrain as^m (instead of ag,^^) from the 
ratio of the galaxy-mass correlation function to the galaxy 
autocorrelation function (Sheldon et al. 2004). The galaxy 
bispectrum can yield a direct estimate of erg by determin- 



© 2005 RAS, MNRAS 000, 000-000 



Redshift- Space Distortions with the HOD 23 




0.1 0.5 1 0.4 0.6 0.8 1 



Figure 16. Diagnostic power of tlie small scale distortion measure, (a) Points show rgy2(0-l) against fimOiy for all of the F = 0.2 
models, except a„ = and o^c = 1. The upper line shows a least-squares fit, r^y2(0.1) = 3.7(OmoJ)''''*®, with o / ~ '^'^^ lower 
points, shifted down by 0.2 dex for visual clarity, show the F = 0.12 results. These data lie ^ 7% below the (shifted) line, (b) Dependence 
of on for models in which both the large scale distortions (/3 = 0.46) and the small-scale distortions [rj/2(0.1) = 2.0 h~^Mpc] 
are fixed to the same values. Circles, squares, and triangles represent r^- =3,4 and 5 h~^Mpc, respectively. 



ing the large-scale galaxy bias factor (Fry 1994; Verde et al. 
2002). 

Our blueprint has significant advantages relative to the 
linear-exponential model or the alternative fitting procedure 
of HC99. First, our approach is more accurate for a wide 
range of cosmological models (Figs. 151 112II . Averaging over 
both values of F used, our fitting function for Pq/r yields /3 
with an rms error of 1.6% for the range of models presented. 
For the P2/0 a-nd diagnostics, the fitting functions yield 
rms errors of 4.1% and 3.9% respectively. Second, our ap- 
proach makes use of the small-scale anisotropy as a tool 
for breaking parameter degeneracies, instead of treating the 
galaxy dispersion as a nuisance parameter. Constraints on 
erg and av from these small scale measures can be used to 
further improve the (3 estimate. 

The fitting formulas presented here are designed to al- 
low straightforward parameter estimation given measure- 
ments of 5(rCT,r^) and Pz{k,n). Alternatively, one can 
use simulations to calibrate a fully analytic description of 
redshift-space anisotropy, in which case one can fit data di- 
rectly using f2,„, (Tg, and as the fitting parameters. We 
will develop such a model in Paper II; achieving the accu- 
racy demanded by data sets like the SDSS and the 2dFGRS 
is not easy, but it is possible. The analytic method is more 
flexible than the fitting formula approach, allowing one to 
take more complete advantage of information in ^(r^, r.^) or 
Pz{k,iJ.). At the opposite extreme, one can circumvent ana- 
lytic formulations entirely and fit data by directly populat- 
ing halos of N-body simulations and measuring anisotropy, 
using the f2m.-scaling technique of this paper to improve effi- 
ciency. With large volume simulations that resolve the nec- 
essary halo masses, this method should achieve the highest 
accuracy because it fully describes non-linear halo cluster- 
ing, and it can address corrections to the distant- observer 
approximation and other technical issues that are difficult 
to model analytically. In practice, it will probably be best 
to use the fitting formulas or an analytic model to locate 



the most interesting regions of parameter space, then use 
focused numerical simulations to check and refine estimates. 

For the a„ = model, large-scale anisotropy measures 
agree reasonably with linear theory over a substantial range 
in scale. This result suggests that FOG compression plus 
linear theory is a viable alternative approach to estimating 
(3. Assessing the systematic uncertainties of this method re- 
quires tests with realistic mock catalogs that quantify the 
ability of the FOG compression algorithm to correctly iden- 
tify and compress true FOGs in galaxy survey data. 

There are several limitations to our blueprint. With 
two exceptions, we have assumed that central galaxies move 
with the center of mass of the halo, i.e. a^jc ~ 0. Chang- 
ing Ovc to 0.2 makes minimal difference in both the large 
scale measures and However, setting a^c ~ 1 makes 

a considerable difference. Current hydrodynamical simula- 
tions (Berlind et al. 2003) suggest a„c J; 0.2 is a reasonable 
assumption, but the issue merits further investigation be- 
cause of its significant impact on redshift-space anisotropy 
modeling. Analysis of SDSS galaxies shows that central- 
satellite galaxy pairs indeed have a narrower velocity spread 
than satellite-satellite pairs (T. McKay et al., in prepara- 
tion). We have also assumed that Ov is independent of halo 
mass. This assumption should be adequate because most 
one-halo pairs come from a limited range of halo masses; low- 
mass halos have no satellites, and high-mass halos are rare. 
To significantly alter our results, a„ would need to depend 
strongly on mass in the relatively narrow range Mi/2 — 5Mi, 
and even then its effect might be well represented by an av- 
erage value. The weak mass dependence seen in the simula- 
tions analyzed by Berlind et al. (2003) does not affect the 
results here, but the question again merits investigation in 
future hydrodynamic studies of galaxy formation. One can 
also test for mass dependence of Ov by comparing the pre- 
dicted and observed scalings of group velocity dispersions 
with group richness. 

The experiments illustrated in Figure |7| show that 
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changing the details of the HOD, or the assumption about 
spatial bias within halos, has negligible impact on redshift- 
space distortions provided one matches the same real-space 
clustering. However, our investigation of these points is 
not exhaustive. Effects of changing q„c, making q„ mass- 
dependent, or changing HOD prescriptions while maintain- 
ing iR{r) can all be examined in more detail using the ana- 
lytic model of Paper H. 

The simulations presented in this work have less dy- 
namic range than is ideal. At the largest scales, our nu- 
merical predictions are less precise than the measurement 
precision achievable with 2dFGRS or SDSS data, though 
not by a large factor. We have focused on predictions 
for luminosity-threshold galaxy samples with space density 
5.6 X 10"'' (/i~^Mpc)~'', corresponding roughly to M,. < 
— 20 + 51og/i. To make predictions or test fitting formu- 
las for fainter galaxies, which occupy less massive halos, one 
would need higher resolution simulations but similar sim- 
ulation volumes. To get precise results for more luminous 
galaxies that reside in rare, massive halos, one would need 
larger simulation volumes, though the mass resolution re- 
quired is lower. The analytic model described in Paper II 
can easily be applied to samples with different luminosity or 
color selection and correspondingly different HODs, and it 
automatically extends to large scales. However, additional 
simulations will be needed to test the accuracy of the ana- 
lytic model in these regimes. 

The monopole-to-real space ratios, Pq/r and 5o/i?i 
have smaller systematic errors as estimators of /3 than the 
quadrupole-to-monopole ratios P2/0 a-nd Q5. However, we 
have not addressed the problem of estimating C-r(^) or Pnik) 
from data. Techniques for estimating these quantities exist 
(e.g., Davis & Peebles 1983; Hamilton et al. 2000; Zehavi 
et al. 2004b; Tegmark et al. 2004a), but we do not yet know 
whether they are accurate at the ~ 1 — 2% level required if 
they are not to contribute significantly to uncertainties in 
the estimates of /3. 

Despite these limitations, our results demonstrate that 
HOD modeling can substantially improve the accuracy and 
precision of redshift-space distortion analysis by replacing 
ad hoc extensions of linear perturbation theory with a com- 
plete, fully non-linear description of dark matter dynamics 
and galaxy bias. This level of sophistication is required to 
take full advantage of data provided by the 2dFGRS and 
SDSS. Precise cosmological constraints from galeixy cluster- 
ing complement those from other cosmological observables 
like CMB anisotropy, gravitational lensing, the Lyman-a for- 
est, or Type la supernovae. They thus enhance the opportu- 
nity to detect departures from the simplest ACDM model, 
which could provide insight into the physics of dark energy 
or the origin of primordial fiuctuations. 
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